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ABSTRACT 

We present new measurements of the redshift-space three-point correlation function (3PCF) of 
Luminous Red Galaxies (LRGs) from the Sloan Digital Sky Survey (SDSS). Using the largest dataset 
to date, the Data Release 7 (DR7) LRGs, and an improved binning scheme compared to previous 
measurements, we measure the LRG 3PCF on large scales up to ~ 90 /i _1 Mpc, from the mildly 
non-linear to quasi-linear regimes. Comparing the LRG correlations to the dark matter two- and 
three-point correlation functions, obtained from N-body simulations we infer linear and non-linear bias 
parameters. As expected, LRGs are highly biased tracers of large scale structure, with a linear bias 
b\ ~ 2; the LRGs also have a large positive non-linear bias parameter, in agreement with predictions 
of galaxy population models. The use of the 3PCF to estimate biasing helps to also make estimates 
of the cosmological parameter cr s , as well as to infer best-fit parameters of the Halo Occupation 
Distribution parameters for LRGs. We also use a large suite of public mock catalogs to characterize 
the error covariance matrix for the 3PCF and compare the variance among simulation results with 
jackknife error estimates. 

Subject headings: cosmology: large-scale structure of the universe, observations, high-order clustering 



1. INTRODUCTION 

The large-scale structure traced by galaxies is shaped 
both by cosmic expansion history, which determines 
the gravitational evolution of density perturbations, and 
by the physics of galaxy formation. Comparison of 
galaxy clustering with the predictions of structure for- 
mation models therefore constrains both cosmological 
and galaxy formation parameters. For a given clustering 
observable, e.g., the galaxy two-point correlation func- 
tion (2PCF), £ gg (r), there are typically significant de- 
generacies between the inferred cosmological and galaxy 
formation parameters. Measurement of multiple obscrv- 
ables with different relative dependencies on cosmology 
and galaxy formation can help break such degen eracies 
(jAbazajian et alJ [20051 IZheng fc Weinberg I2007T ) . The 
galaxy three-point correlation function (3PCF), the next 
level up in the hierarchy of iV-point correlation func- 
tions, encodes information that is complementary to that 
contained in the 2PCF and is therefore a useful sec- 
ond obser vable for constraining cosmology and galaxy 
form ation (Scfusat ti fe Scoccimarrol20 05: Scfusatt i et al.1 

I20M ). 

The 3PCF has been measured since the ad vent of the 
first angular catalogs (jPeebles fc Grothlll975D . and more 
recently in the la st generation of spectroscopic surveys 
such as 2dFGRS dJing fc Bornerll2004t IGaztanaga et al l 



2005h. and SPSS dKavo et all 12004 INichol et all 1200ft 



Kulkarni et aTll2007l : iMcBride et alJl2010ft . Thegoals of 



volume of current surveys, we are able to improve the 
signal-to-noise of the measurements, and obtain more in- 
formation from the 3PCF to help constrain cosmological 
and galaxy formation models. 

In this paper, we measure the 3PCF for a particu- 
lar class of galaxies, th e Luminous Red Galaxies (LRGs) 
(|Eisenstein et al.ll2001l) observed s pectroscopically b y the 
Sloan Digital Sky Survey (SDSS) (lYork et alJl2000h . and 
use the results to constrain galaxy bias and cosmo- 
logical parameters. LRGs offer several advantages for 
this measurement. At the bright end of the galaxy lu- 
minosity function, LRGs can be targeted out to rela- 
tively large distances, z ~ 0.5 (whereas the average 
rcdshift is z ~ 0.3), in the SDSS spectroscopic survey. 
Their clustering can therefore be measured over scales 
large compa red to those probed by galaxies in the 
same survey (lEisenstein et alJ2005l;lTegmark et~aLll200ft 
IPercival et all 120071 : IGaztanaga et alll2009b[ ). which are 
on average at z ~ 0.1. In addition, LRGs arc more 
strongly clustered than less-luminous galaxies, making 
their correlations intrinsically easier to measure on large 
scales, where shot-noise is subdominant. 

The physics of galaxy formation is imprinted in the 
iV-point galaxy correlation functions via the bias, the re- 
lation between the spatial distributions of galaxies and 
dark matter. Although the Hal o Occupation Distribu- 
tion model (HOD, see for instancePing et al .111 9981: ISeliakl 
20001 iScoccimarro et alll200lHBerlind fc Weinberg| l2002t 



these efforts have been mostly to test predictions from 
theories of growth of structure and cosmological simula- 
tions, as well as to measure the biasing of the galaxies 
with respect to the dark matter 3PCF. With the large 
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Coorav fc Shethl l2002t iZheng et all 120051) has become a 
popular framework for parametrizing the bias (especially 
on small scales), here we focus on using the 2PCF and 
3PCF LRG measurements on large scales to constrain a 
simpler, non-linear, local bias model that appears to ad- 
equately capture the features displayed by the clustering 
data. This approach allows us to break the degeneracy 
that exists between the linear bias parameter and the 
r.m.s of density fluctuations at scale of 8 /i _1 Mpc, erg, 
when extracting information from the 2PCF, as shown, 
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Fig. 1. — Parametrization of triangles for measuring the three- 
point correlation function. 



using different st atistics, in iPan fe Szapudil (120051 ) and 
IRoss et all (|2008D . Although a complete and detailed 
HOD analysis using fits to the 2-poi nt and 3-point func - 
tions will be presented elsewhere (jSabiu et al.l l2010f ). 
we explore the idea of using the LRG bias param- 
eters in constraining HOD m odels, as suggested by 
iSefusatti fe Scoccimarrol (120051) . 

We note that IGaztanaga et al.l (|2009al ) have recently 
measured the SDSS LRG 3PCF on very large scales, 
focusing on d etection of the baryon acoustic oscilla- 
tion signature. iReid fc Spereel (l2009l) used a counts-in- 
cylin ders statisti c to co nstrain the LRG HOD. In addi- 
tion, IRoss et all ([2008D measured higher-order angular 
clustering of a sample of photometrically selected LRGs 
from the SDSS and used it to constrain non-linear bias 
parameters and get constraints on erg- 

Our goal here is to present an up-to-date measurement 
of the 3PCF, show another way to use the 3PCF informa- 
tion on large scales, in this case to calculate the bias, and 
offer complementary constraints on as, and show that we 
can use the 3PCF as a complement to find best-fit HOD 
parameters. 

The outline of the paper is as follows. In fj2]we review 
the 3PCF and describe our binning scheme for the 3PCF 
measurement; we also describe the SDSS LRG sample 
and the simulations used to create mock catalogs. In £j3] 
we present our measurements of the 3PCF in the LRG 
sample. In f]4]wc discuss the estimation of 3PCF errors, 
which are critical in constraining parameters. In [J5] we 
constrain the non-linear bias parameters of the LRGs 
and explore the possibility of constraining the matter 
power spectrum amplitude as, as well as relevant HOD 
parameters. We summarize and conclude in S|6] 

2. METHODS, DATA & SIMULATIONS 

2.1. The 3-point correlation function 

The 3PCF describes the probability of finding three 
objects in a particular triangle configuration, compared 
to that of a random sample. The joint probability of 
finding three objects in t hree infinitesm al volumes dV±, 
dV 2 , and dV 3 is given by (|Peebleslll980t) 

P = [1 + £(r X2 ) + £(r 23 ) + £(r 31 ) + C(r 12 , r 23 , r 3 i)] x 
n z dV x dV 2 dV 3 , (1) 

where n is the mean density of objects, £ is the 2PCF, 
and £ is the (connected) 3PCF: 

an 2 ) = (S(n)S(r 2 )) (2) 
CO12, r 23 , r 3 i) = (S(ri)6(r 2 )S(r 3 )); (3) 

where S is the fractional overdensity of objects or 
the continuous field studied. The triangle sides m 



arc the distances between objects i and j in the 
triplet. Since the 3PCF depends upon the configura- 
tion of the three sides , it is sensitive to the s hapes 
of spatial structures (ISefusatti fc Scoccimarrol 120051 



IGaztanaga fc Scoccimarrol 120051 : IMarin et al.l 1200* 

Since the ratio C/£ 2 is both predicted and found to be 
close to unity over a large range of length sca les even 
thoug h £ and £ each vary by orders of magnitude (jPeeblesI 
I19801 ) . it is convenient to define the reduced 3PCF, 



Q(s,u,9) = 



C(s,u,9) 



S(r 12 )£(r 2 3) + ttr 23 )£(r 31 ) + £(r 3 i)£(r 12 ) 



(4) 

Here, s = r\ 2 sets the scale size of the triangle, and the 
shape parameters are given by the ratio of two sides of 
the triangle, u = r 23 /ri 2 , and the angle between those 
two sides, = cos - 12 ■ r 23 ), where f\ 2 , f 23 are unit 
vectors in the directions of those sides (see Figure [1]). By 
measuring the shape- and scalc-dcpcndcncc of Q(s,u,9) 
for galaxies and comparing with that predicted for dark 
matter, we can constrain the galaxy bias. 

We calculate the 2PCF using the estimator of 
ILandv fc Szaiavl (IT993T ). 



DD - 2DR + RR 
RR ' 



(5) 



Here, DD is the number of data pairs normalized by 
No x Nd/2, DR is the number of pairs using data and 
random catalogs normalized by NdNr, and RR is the 
number of random data pairs normalized by Nr x Nr/2, 
where Nrj and Nr are the number of points in the data 
and in the random catalog, respective l y. T he 3PCF is 
calculated using the lSzapudi fc Szaiavl ( 19981 ) estimator: 



DDD - 3DDR + 3DRR - RRR 
RRR 



(6) 



where DDD, the number of data triplets, is normalized 
by N D /6, and RRR, the random data triplets, is nor- 
malized by N R /6. DDR is normalized by N d Nr/2, and 
DRR by NrjN R . These estimators optimally correct for 
edge effects by incorporating pair and triplet counts from 
a random catalog. To compute the correlation functions, 
we use the NPT software developed in collaboration with 
the Auton Lab at Carnegie Mellon University. NPT is a 
fast implementation of TV-point correlation function es- 
timation using multi-resolution kd-trees to compute the 
number of pairs and triplets in a dataset. For more de- 
tails a n d informati o n on the al gorithm, see I Moore et aT] 
(|2WI) . [Gravet~aII pOOl . and lNichol et all (I2006lh " 

2.2. Measuring the 3PCF: Binning 

Since measurements of two- and three-point correla- 
tions involve counts of pairs and triplets of objects, 
the question arises of how to choose the bins of pair 
and triplet s e parat i on (IGaztanaga fc Scoccimarrol 120051: 
Nichol et all 120061: IKulkarni et al.l 120071: IMarin et all 



2008fl . Small bins enable measurements with fine res- 



olution in separation and the possible identification of 
features in the correlation functions but at the cost of 
large shot noise due to the small numbers of pairs and 
triplets per bin. Large bins reduce the shot noise but 
tend to wash out features of the 3PCF. The optimal bin 
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Fig. 2. — Comparison of two binning schemes for the 3PCF mea- 
surement. Left: bins in r,j used in this paper, with Ar^ = O-lr^; 
Right: bins in s, u, 6, with As = 0.1s, Ait = 0.1, A6» = tt/100. 
Black (solid) triangles show the central configuration for the bin, 
with u = 2 and 8 = 100°. Blue (long dashed) triangles are the 
smallest triangles in that bin. Red (short dashed) triangles are 
the largest in the bin. For the nj bins, the largest and smallest 
triangles in the bin are similar to the central triangle. 
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Fig. 3. — Comparison of two binning schemes the DR3 LRG 
3PCF measurement. DR3 Sample LRG reduced 3PCF in for s = 7 
h~ x Mpc (top) and s = 10 fc _1 Mpc (bottom) with similar resolu- 
tion but different binnin g schemes. Blue solid line represents 3PCF 
from the scheme used in (Kulkarni ct al. 2007, K07 in the plot) and 
open squares used the binning representation used in this paper. 
Diagonal errors obtained from jack-knife resampling method. 

size is therefore a compromise and depends on the rela- 
tive importance of signal-to-noise vs. resolution for the 
question at hand. 

For the 3PCF, there is also the question of which sep- 
aration parameters to bin in, that is , which triang l e con - 
figu rations to include in each bin. iNichol et all ()2006l ) 
and lKulkarni et all ()2007l ) measured the 3PCF in bins of 
s, u, and 9, that is, they counted triplets of galaxies that 
satisfy s 6 so± As, u e uq ± Au, 9 G 9q ± A9 for the 
bin centered on uq, sq, 9q. In the ir measurement of the 
LRG 3PCF using the DR3 samme. lKuTkarni et all ([20071 ) 
used very small resolution in As = ±0.1 Mpc (1% at 10 
/i _1 Mpc) and A9 = ±7r/100, and an intermediate value 
for Au = ±0.1. With such small resolution the result 
is that Q(9) is measured with low signal-to-noise (since 
there are few pairs and triplets in each bin) and since 
Aw is not small (for u = 2, s = 10 the bin size of the 
second side of the triangle is 12% of the side), there are 
triplets counted in two or more configurations, or 'bins'. 
With this binning scheme is not possible to measure the 



LRG 3PCF on larger scales. But just increasing bin size 
in this way would lead to have very different triangles 
accepted in the same bin, as shown in the right side of 
Fig. El 

In this work, we use the parameters s, u and 9 to deter- 
mine the centers of the triplet bins, but we construct the 
bins themselves directly in each of the pair separations, 
with Arg oc rg, i ,e., we use bins of constant Alog?-^- 
(iMarm et al.|[200cl iMcBride et al.|[2010bl iMcBride et all 
120101). As Fig. El shows, this binning scheme groups 
together triplet configurations that are more similar in 
shape (both mathematically and colloquially) and size 
than do bins in s, u, and 0. As a result, we can achieve ei- 
ther higher resolution in triangle shape and size or higher 
signal-to-noise for fixed resolution. Also to obtain a bet- 
ter signal in the 3PCF, we increase somewhat the size 
of the bins, which leads to a bigger correlation between 
the points, and therefore we do not need to measure the 
3PCF for a large number of angles 9 as Kulkarn i et all 
((20071) did. 

We show in Figure [3] the results of using the different 
binning schemes: using the DR3 LRG sample (see §2.41 
for a description of the sample) , we compa re the resultant 
3PCF using the binning scheme used in (Ku lkarni et all 
[20071 in green solid lines), with the scheme used in this 
paper, for Ar^ = ±0.1ry (in open squares); it is clear 
that in our binning scheme Q{9) fluctuates much less 
than when using the 'old' scheme, it reproduces better 
the theoretical predictions (note that there is no plateau 
in the new scheme that happens in the 9 > 100 in the 
s = 10 h~ 1 Mpc configurations), and h as smaller errors 
(calculated, as in iKulkarni et "all ([2007T) . using jack-knife 
resampling, which will be discussed in Q. 

We take advantage of this binning scheme to measure 
the 3PCF over a large range of triangle sizes and shapes: 
we study central configurations with s ranging from 7 
(which allows us to compare this measurements to pre- 
vious works) to 30 /i -1 Mpc (on the quasi-linear regime, 
in order to estimate bias parameters), with u = 2 and 
15 equally spaced values of 9. For other studies (outside 
the scope of this paper) such as finding best-fit galaxy 
population (HOD) parameters from fits to the 3PCF, it 
is necessary to carry out measurements on smaller scales. 
The results are limited by shot noise on small scales (due 
to the small spatial density of the LRG field) and by fi- 
nite volume on large scales. 

In Figure U] we illustrate the sensitivity of the reduced 
3PCF measurement to the choice of binning resolution. 
We show measurements with s = 7 and 15 /i -1 Mpc for 
the SDSS DR7-Dim LRG sample (see §23)- The blue 
dotted, solid red, and dashed magenta curves represent 
results with Arij/rij = ±0.05, 0.1, and 0.2, respectively. 
We can see significant bin-dependent differences in the 
reduced 3PCF amplitude, particularly for elongated tri- 
angle configurations (9 close to 0° or 180 °). The dif- 
ferences appear to increase for larger scales. Proceeding 
from smaller to larger bins, the configuration-dependence 
of Q is smoothed out, and the measurements for differ- 
ent values of 9 become more strongly correlated. On the 
other hand, the signal-to-noise for each measurement in- 
creases with bin width. We choose our binning to bal- 
ance the need for a good 3PCF signal with the ability 
to discern shape-dependent features of the 3PCF. After 
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Fig. 4. — Effects of binning in the 3PCF measurement for the SDSS DR7-Dim LRG sample, for configurations with u = 2. Left: 
measurements with s = 7/i — x Mpc. Right: s = 15 h~ 1 Mpc. The blue dotted, solid red, and dashed magenta curves represent results with 
Anj/rij = ±0.05, 0.1, and 0.2, respectively. 

experimenting with simulations, we settled on a 14%, i.e. 
Ar = ±0.07r binning resolution, for all LRG samples we 
studied (see 32.4p . 



withi n halos according t o the NFW halo mass pro- 
files (E avarro et al.l I1997TI . The HOD parameters can 
be foun d using different statisti c s of galaxy cluster- 
ing fee. iZehavi et al.|[20p. I2005bl; iKulkarni et al.l 120071: 



2.3. The HOD Model 



The HOD mo del 
Scoccimarro et ah[ 



il (I.Ting et al.l HoM ISeliakl 
120011: IBerlind fc Weinberg! 



2000; 



2002 



Coorav fc Shethl 120021; IZheng et alJl2005T eg.) provides 
a parametrized prescription for the galaxy spatial 
distribution. The first component is a model for the 
distribution of dark halos. One specifies the halo 
mass function, n(M), the spatial correlations of the 
halos, and the density profiles for halos of mass M, 
all based on a nalytical models and fits to N-body 
simulations (eg. I Press fc Schechterlll974t iWhite fc Reesl 
[19781 [Sheth et al.l 1200 It iTinker et al. The second 

component is the HOD itself, a model specifying how 
galaxies occupy dark halos as a function of halo mass. 

The HOD is a parameterized model of the PDF 
P(N\M), the probability that a halo of mass M contains 
N galaxies with specified properties. The mean occupa- 
tion function, (N(M)), is well modeled when one sep- 
arates the contribution from central galaxies, N cen (M), 
which is roughly a step function in halo mass (for ha- 
los above a certain mass M m i„), and the contribu- 
tion of the satellite galaxies, N sat (M), which appears 
to be well characteri z ed by a power law i n halo mass 
(iGuzik fc SeliaJdl200l Ikravtsov et aT][200l [Zheng et al.l 
120051) i.e.. AT,„, ~ (M/Mi) Q , where M x characterizes the 
mass of halos that host satellite galaxies, and the ex- 
ponent a describes the high-mass slope of the satellite 
occupation number. 

There is not a unique model for the galaxy HOD; 
in fact there are many functional forms of (N(M)), as 
well as the number of parameters describing a particu- 
lar HOD. Also, apart from the mean HOD, one needs 
to specify the higher moments of P(N\M). The dis- 
tributions of N cen (M) and N sat (M) are found to be 
consistent wit h nearest-integer (Be r noulli) and Poisso n 
distributions ([Kravtsov et all 120041; IZheng et all 120051 ): 
and the satellite galaxies are assumed to be distributed 



Blake et al.l [2008t IReid fc Spergell 120091; IZheng et al.l 
20091: IRoss fc Brunnerll2009D . 

In the case of L RGs, there is con sensus (eg. 

Kulkarni et all 120071; IZheng et all 120091; iSabiu et al.l 
20101 and references therein) that these galaxies form 
mostly in high-mass halos with mass M ~ a few times 
10 13 to 10 14 /i _1 M Q , while a few percent of them (in ha- 
los with M > 10 14 /i~ 1 M Q ) have to be satellites within 
halos i n order to produce th e 2PCF exhibited on smaller 
scales (jMasiedi et al.l [20061 ); the exact values differ de- 
pending on the statistic used and models of (N(M)) that 
were used. As we will see below in £12.51 the HOD model 
is a powerful way to create mock galaxy catalogs, and in 
^5.21 the HOD is also effective in modeling the non-linear 
and the large-scale bias. 

2.4. The SDSS LRG sample 

The SDSS Luminous Red Galaxies, as s electe d by the 
algorithm developed by lEisenstein et al.l (|2001l ) , are an 
excellent tracer of dark matter on large scales due to 
their high luminosity, which allow us to map them up 
to z ~ 0.5 and comoving volumes up to 1.5 (h~ 1 Gpc) 3 . 
The selection algorithm, and the high success rate of the 
spectroscopy makes (average sky completeness of 98%) 
the LRG sample nearly volume- limited up to z ~ 0.35 
and flux limited up to z ~ 0.47. 

We use galaxies drawn from a sample compris- 
ing 105,831 spectroscopically s elected LRGs, based o n 
SDSS Data Release 7 (DR7, lAbazaiian etaTI [20091) . 
The LRG galaxy sele ction algorithm is described in 
lEisenstein et al.l (|2001l) . and the DR7 LRG samples we 
us e are drawn from the "DR7-Full" catalog described 
in lKazin et al.l (|2010|) . which is publicly available^. The 
DR7-Full sample spans the redshift range 0.16 < z < 
0.47 and includes a range of luminosities in SDSS g— band 

3 http://cosmo.nyu.edu/~cak306/SDSS-LRG.html 
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TABLE 1 
SDSS LRG Samples used 



Sample 


area 




redshift 


Luminosity 




Number of galaxies 


avera 


se 


density 


DR7-Full^ 


7,908 deg- 


-2 


0.16 < z < 0.47 


-21.2 > M g > - 


23.2 


105,831 


6.70 X 10- 


-5 


O^Mpc) 3 


DR7-Dim~ 


7,189 deg" 


2 


0.16 < z < 0.36 


-21.2 > M g > - 


23.2 


61,899 


9.40 x 10" 


-5 


(/i-iMpc) 3 


DR7-Bright a 


7,189 deg" 


2 


0.16 < z < 0.45 


-21.8 > M g > - 


23.2 


30,272 


2.54 x 10- 


-5 


(/i-iMpc) 3 


DR3-Full b 


3,816 deg- 


2 


0.16 < z < 0.55 


-21.2 > M g > - 


23.2 


50,987 


6.50 x lo- 


5 


(h^Mpc) 3 



a from lKaz~in et al.l <201Cft 
b from lEisenstein et all l[2005h 

of —23.2 < M g < —21.2, where the magnitudes have 
been K-corrected and passively evolved to z = 0.3, near 
the median redshift of the sample. The sample is nearly 
volume-limited for redshifts z < 0.36 and drops off due 
to the flux limit at higher redshift. The sample cov- 
ers a sky area of 7908 deg 2 (close to 20% of the sky) 
and includes data from both the northern and southern 
Galactic hemispheres. 

From this main catalog, two important samples are 
extracted: one that includes all LRGs, but is volume- 
limited, which we call "DR7-Dim" , which has a maxi- 
mum redshift of z = 0.36 and median spatial density of 
~ 10~ 4 (/i^Mpc) -3 , and a sample of the bright LRGs, 
where M g < -21.8, which we call "DR7-Bright" , which 
reaches redshifts up to z = 0.44 and has a low spatial 
density ~ 2.5 x 10~ 5 (/i _1 Mpc) -3 . Both of these sam- 



ples a re tne same as tne ones presented m iriazm et ai.l 
pOlOt ). and use data from the northern cap only, in order 
to have more mock catalogs when estimating the errors 
and covariance of our measurements (see ij2.5l for more 
details) . 

To calculate the correlation f unctions, we use th e same 
DR7-Full random catalogs as in lKazin et al.l[2010l which 
contain 10 times the number of LRGs as the data. These 
random catalogs have been built having the same redshift 
selection function as the data, as well as the same sky 
completeness. For the purposes of calculating the 3PCF 
in a reasonable time, on the largest scales we lower the 
number of random points; when measuring the 3PCF at 
s = 30 ft, _1 Mpc, our largest scale, our random catalog 
has 4 times the number of data points. 

For comparison, we have also analyzed the earlier 
SDSS DR3 LRG sample used by iKulkarni et all [20071 
That sample covers 3816 deg 2 and contains 50,967 LRGs 
spanning the redshift range 0.15 < z < .55. In this case, 
we use the data and random catalogs of lEisenstein et all 
(2005), kindly provided by the author. 

The information on these samples is summarized in 
Tabic [1] 

The LRGs, i.e., giant red ellipticals, are associated with 
masssive halos, and they are are highly clustered (see for 
instance IZehavi et al.ll2005al iMasiedi et alJl2006| ). This 
is illustrated in Figure where we show the redshift 
space 2PCF of the DR7-Dim and DR7-Bright LRGs, 
along with the dark matter redshift 2PCF from N-body 
simulations with a c osmological paramet ers based on 
the WMAP5 results (jKomatsu et al.ll2009L described in 
i }2~5|) . Both LRG samples have a higher 2PCF than the 
dark matter, and on large scales the offset seems con- 
stant, and due to this fact we can say that the LRGs are 
a biased tracer of dark matter on large scales. The DR7- 
Bright LRGs have a higher 2PCF than the DR7-Dim 
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Fig. 5. — The redshift space 2PCF for LRGs. Red open squares 
correspond to measurements in the DR7-Dim LRG sample, ma- 
genta stars represent the 2PCF of the DR7-Bright LRG sample. 
Thick black solid line c orresponds t o the d ark matter 2PCF, from 
N-body simulations by Sabiu ct al. (2010). Errors from the vari- 
ance of LasDamas LRG mock catalogs. 

LRGs, suggest ing that they are associated with more 
massive halos ([Zheng et al.lf2009l ). 

2.5. Simulations & Mock catalogs 

To help analyze and interpret the 3PCF results, we 
use mock catalogs drawn from N-body simulations. The 
mock catalogs serve two functions: (i) to measure the 
3PCF of dark matter, with which the galaxy 3PCF mea- 
surements can be compared to infer the bias, and (ii) to 
determine the expected error covariance matrix for the 
3PCF from the variance among a large number of mock 
catalogs. We use different simulations tailored to these 
different functions. 

First, to study the dark matter two- and three-point 
correlation functions, we use si x N-body simulations car- 
ried out by I Sabiu et"aLl (|2010f ) , kindly provided by the 
author, using the GADGET code. The cosmological 
model is a spatially flat ACDM universe, with matter 
density parameter fi m = 0.27, baryon density fib = 
0.045, power spectrum amplitude erg = 0.8, reduced Hub- 
ble parameter h = 0.7, and initial power spectrum index 
n s = 0.95. The simulation box volume is 1 (/i^ 1 Gpc) 3 
and contains 512 3 dark matter particles with a particle 
mass of 5 x 10 11 /i _1 M0, with a spatial softening of 60 
h~ 1 kpc, and we use the output at z = 0.3, the mean red- 
shift of the LRGs, to compare their correlation functions. 

Since using all the dark matter particles for the 3PCF 
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Fig. 6. — The redshift-space, reduced LRG 3PCF, Q(8), for configurations with u = 2 and s = 7 (left), 10 (middle), and 15 (right) 
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LRG sample. Magenta long-dashed line: DR7-Bright LRG sample. Blue dotted line: DR3-Full LRG sample. Thick black line: Dark 
Matter 3PCF from simulations. Error bars from the variance of LasDamas LRG mock catalogs. 



calculations would take a prohibitively long time, we 
randomly select particles from the simulation, to a de- 
gree where we are sure that the dilution will not affect 
the results. For the 3PCF measurement, this depends 
on scale, and we use between 5% and 0.5% of the par- 
ticles (a higher fraction for smaller scales). To obtain 
the spatial coordinates in the redshift-space dark-matter 
catalog, we use the long-distance approxim ation in the 
z-direction (see e.g iBernardeau et ail I2002T ) to account 
for the effect of peculiar velocities on the measurement 
of the correlation functions. 

We use the dark matter halo catalogs obtained from 
these simulations to study redshift distortions in the esti- 
mation of the bias parameters (section jj5_,2J be low) , found 
using a Friends-of-Friends algorithm (jDavis "eTafl [19851) 
with a linking length of b = 0.2 times the mean inter- 
particle separation. To populate halos with mock LRGs, 
we use the best-f it HOD parameter s from fits to the pro- 
jected 2PCF by (jZheng et al.ll2009D in their 5-parameter 
model. We assign mock galaxy positions, based on halo 
profiles, and peculiar velocities (based on a gaussian dis- 
tribution, which depends on the mass of the host halo) 
of galaxies inside halos using a code kindly provided by 
Jeremy Tinker. 

Second, to estimate the errors in the 3PCF estimates, 
we use the covariance amon g public mock catalog s from 
the LasDamas simulations dMcBride et al.l l2010bT FI. As 
will be discussed in Sj4j we prefer this error estimation 
method to the use of jack-knife (JK) resampling. For the 
LasDamas simulations, the cosmological model is simi- 
lar but not identical to that above: flat, ACDM with 
f2 ro = 0.25, n b = 0.04, a$ = 0.8, h = 0.7, and n s = 1. 
For this set of parameters, the LasDamas team has car- 
ried out N-body simulations in boxes of different sizes 
to model galaxies of different luminosities; for the LRGs, 
they use 40 boxes with volume of 2400 3 (/i _1 Mpc) 3 , each 
one containing 1280 3 dark matter particles with a parti- 
cle mass of 4.57 x 10 11 H^Mq. 

These simulations are used to construct mock galaxy 
catalogs by placing galaxies in dark matter halos us- 
ing a Halo Occupation Distribution (HOD) model 
(|Berlind fc Weinberg! 120021 ) . with HOD parameters fit 



http: / /lss. phy.vanderbilt.edu/lasdamas/ 



from the observed SDSS LRG 2-point clustering. From 
this main set they create two types of galaxy mock cat- 
alogs, one that resembles the LRG DR7-Dim catalog, 
with z max = 0.36, and one set that resembles the Bright 
LRG sample with z max = 0.44. The mocks reproduce 
the SDSS DR7 geometry, and they include redshift-space 
distortions from peculiar velocities. These galaxy mocks 
have been shown to reprodu ce well the form of the DR7 
LRG 2PCF on large scales (jKazin et al.ll2010l ). and we 
show below that they are in general agreement with the 
3PCF measurements as well. Since our DR7-Dim and 
DR7-Bright samples use only the northern cap of the 
SDSS, in each LasDamas box it is possible to extract 
four mock LRG catalogs; in this way, we have 160 mock 
catalogs for each sample. 

3. RESULTS 

We present the redshift-space LRG 3PCF measure- 
ments in Figures |6][7] for configurations with s between 
7, and 30 /i _1 Mpc, with u = r%/ri = 2. We chose these 
scales, which go from the mildly non-linear to the quasi- 
linear regime, in order to calculate the large-scale bias 
of LRGs, and use only triangles with u = 2 since that 
puts all triplet distances (i.e. the sides of the triangle) 
above the s selected. If we used u = 1, then our most col- 
lapsed triangles would have been in the strong non-linear 
regime, affecting our bias estimates. We could also try 
triangles with u = 3 or more, but then our biggest dis- 
tances would be comparable to the scale of the survey, 
and also computing the 3PCF becomes prohibitive We 
first discuss the measurements on intermediate and large 
scales and then explore the 3PCF error estimates. 

3.1. LRG 3PCF on intermediate scales 

Figure M shows the reduced 3PCF of the DR7 LRG 
samples on intermediate scales, for s = 7, 10, and 15 
ft. _1 Mpc. For clarity, we show diagonal error bars only 
for the DR7-Dim sample, calculating using the 3PCF 
variance between the LasDamas mocks. For comparison, 
we also show the 3PCF for the DR3 LRG sample. The 
solid black curves show the 3PCF of the dark matter 
from the N-body simulations. 

The reduced 3PCF shows the general shape- 
depe ndence expected from g ravitational instability the- 
ory ([Bernardeau et all 120021) : the amplitude is higher 
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for elongated triangle configurations (9 ~ 0°,f80°), re- 
flecting anisotropic velocity flows along density gradi- 
ents. The configuration dependence is less pronounced 
on smaller scales, where quasi-virialized, more isotropic 
flows and structures dominate. Since these measure- 
ments are in redshift space, the configuration dependence 
also reflects the effects of redshift distortions due to pe- 
culiar velocities. 

Since LRGs are biased tracers of the dark matter, the 
3PCF of the LRGs differs from that predicted for the 
dark matter. On small scales, the reduced 3PCF ampli- 
tude of the LRGs is smaller by a nearly constant factor 
from that of the dark matter. On larger scales, the nature 
of the 3PCF bias changes: there is a shape dependence 
in the offset between the LRG and dark matter ampli- 
tudes, with the galaxy 3PCF showing less configuration 
dependence than that of the dark matter. 

Comparing the DR7-Full results to the DR3 measure- 
ments, which cover half the volume of the DR7 sample, 
we find general consistency in the 3PCF amplitude on 
these intermediate scales. For s = 1 5 ft, _1 Mpc, the DR3 
amplitude is slightly lower than that of the DR7 sample 
for all configurations; this is a consequence of the smaller 
volume sampled to estimate t he correlation fun ctions: as 
observed in other studies (see lMarin et al.ll2008L and refs. 
therein), the 3PCF is more sensitive to the sample vol- 
ume than the 2PCF. 

In general, there are no important differences between 
the DR7-Full and the volume-limited DR7-Dim sam- 
ple results. There are differences between these and 
the DR7-Bright sample: the reduced 3PCF for the 
bright LRGs fluctuates much more and it deviates from 
the DR7-Dim sample results for the elongated triangles 
(large 9) for the s = 7 and 15 h^ 1 Mpc configurations, and 
in the collapsed triangles (small 9) for the s = 10 /i _1 Mpc 
configurations. Since we have used a binning more tai- 
lored to measure the DR7-Dim correlations on the DR7- 
Bright (a much less dense sample) measurements, we ex- 



pect more fluctuati ons due to poorer st atistics and effects 
of large structures (jNichol et al.ll2006h : we'll explore the 
variance of bright LRGs in §3.31 below. 

3.2. LRG 3PCF on large scales 

In Figure [7] we show the reduced 3PCF for LRGs on 
large scales, for s = 20 and 30 ft^Mpc, with u = 2.0. 
As expected from 2nd-order perturbation theory, the re- 
duced 3PCF shows a stronger configuration-dependence 
on these scales compared to smaller scales, with a dra- 
matic increase in the difference in amplitude between 
elongated and rectangular configurations at s > 20 
/i _1 Mpc. That behavior is seen for both the dark mat- 
ter and the LRG samples. For s = 30 h^ 1 Mpc, the LRG 
3PCF dependence in 9 is more asymmetrical between col- 
lapsed and elongated configurations, as it happens with 
the dark matter 3PCF. The errors are greatly increased 
as well, and there are strong anticorrelations, i.e. neg- 
ative values of Q(9), for the rectangular configurations; 
this is again a consequence of the filamentary shape of 
the large-scale structure. 

The large scale 3PCF of the DR7-Bright galaxies is 
similar to the DR7-Dim 3PCF, with less significant fluc- 
tuations on larger scales. On these scales the finite- 
volume effects can be observed more clearly in the 3PCF: 
comparing the DR7-Full and DR7-Dim LRG samples, 
we can see that in general they agree within the error- 
bars, and the small differences are due to a combination 
of finite- volume effects and biasing effects (the DR7-Full 
sample includes brighter galaxies at large rcdshifts, which 
have a different clustering bias than the DR7-Dim sam- 
ple). Comparing the DR3-Full results compared to DR7- 
Full and even DR7-Dim, we can see that the DR3-Full 
3PCF deviates from the DR7 samples on the elongated 
configurations (larger scales). 

3.3. Comparison with LasDamas mocks 

In Figure [8] we compare the 3PCF measurements for 
the DR7 LRGs, as shown previously in ij3.ll and 
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Fig. 8. — Comparison of LRG 3PCF with LasDamas mocks measurements. Symbols represent DR7 measurements, solid lines represent 
the mean of the 3PCF measured in real-space LasDamas catalog, dashed lines represent the mean of the 3PCF measured on redshift-space 
mocks. Error bars represent the variance of the redshift-space 3PCF measured in 160 mocks. 



to the ones obtained from the LasDamas mocks, with 
dashed lines for redshift-space measurements, solid lines 
for real-space measurements. Here we present a repre- 
sentative set of values, with s = 7, 15, and 30 ft, _1 Mpc. 
In the top panels we show the comparison for DR7-Dim 
galaxies; in general the agreement is good, and usually 
within the 1-a error bars. 

In the lower panels, we compare DR7-Bright galaxies 
and the mock 3PCFs. It can be seen that the variance of 
the bright mocks is larger compared to the dim ones, and 
the agreement between the mocks and the data is not as 
good, especially on smaller scales for the elongated con- 
figurations. Although due to their low density less signal- 
to-noise and more fluctuations in Q{9) are expected, the 
variance from the mocks should reflect this, and that is 
not the case for the smaller scales. Since the mocks are 
built in order to fit the measured 2-point statistics, it can 
happen that the the mock miss the higher-order statis- 
tics, and for future modeling these statistics have to be 



considered. 

Note also the differences between the real and redshift 
space measurements in the LasDamas mocks. Both have 
a dependence on shape, but on small scales the real space 
3PCF has a more pronounced shape dependence. On 
bigger scales, especially around s = 15 /i _1 Mpc, they 
are very similar within the errors . Comparing thes e 
differences to the ones found by IMarin et al.l (|2008l ) , 
there are two causes of this: first, the LRGs are galax- 
ies that inhabit m assive halos (eg. iKulkarni et alj|2007t 
iZheng et akl feOOQ. among others), which are in the cen- 
ter of the gravitational potential wells of the large matter 
structures, and therefore they are less affected peculiar 
velocities. Second, the 3PCF on larger scales is much 
more affected by the morphology of the structures than 
from the peculiar velocities, and therefore on the largest 
scales the distortions of the reduced 3PCF will be smaller 
than on the small scales. There is also a resolution (or 
binning) component to this; in very thin bins we might 
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be able to detect better the distortions on large scales. 
We will use this similarity between the real and redshift 
space 3PCF below in £ )5.2l to estimate real-space bias pa- 
rameters from the 3PCF. 

4. 3PCF ERROR ESTIMATES 

Two methods have been commonly employed to es- 
timate errors in clustering measurements: the variance 
among mock sampl es and jack-knife (JK) sampling. 
iZehavi et al.1 ()2005bl ) showed that the JK method can 
be reliable for obtaining the covariance matrix in the 
2PCF, comparing with mock catalogs from independent 
realizations. In principle, the errors on the 3PCF de- 
pen d on higher-ord er correlations up to sixth order (see, 
e.g.. iSzapudil 120051 ) and could be calculated analytically 
once the latter are measured. In practice, including edge 
effects and shot noise contributions makes the analytic 
computation of these errors a complicated and compu- 
tationally challenging problem. The jack-knife, in which 
one sequentially removes subvolumes and computes the 
variance among 3PCF measurements for the remaining 
volumes, provides a convenient computational short-cut, 
but there is no fundamental basis for assuming it is ac- 
curate in this context. Indeed, the jack-knife method as- 
sumes that the removed subvolumes are independent, ig- 
noring density perturbation modes on scales larger than 
those of the subvolumes. The variance in 3PCF estimates 
among a large number of simulated realizations of a given 
model (mock samples) provides a ground-truth measure 
of the 3PCF error for that model. If the model provides 
an accurate representation of the actual galaxy cluster- 
ing, then the mock sample variance should provide the 
best estimate of the errors. In the following, we compute 
and compare the mock variance estimates of 3PCF er- 
rors, using the 160 LRG-Dim mocks from the LasDamas 
simulations, to the jack-knife variance estimations from 
a representative LasDamas LRG-Dim mock. In this case 
we divided the mock catalog into many angular regions 
with equal number of galaxies (at the limit of large num- 
ber of subsamples, this is equivalent to have regions with 
equal volume) ; each JK subsample is built by extracting 
one of these regions from the LRG mock. 

4.1. Diagonal Errors 

For the jack-knife and variance methods, the diagonal 
errors are given by 

P N 

a 2 Q = ^o l ^(Qi-Q) 2 . (7) 

For the jack-knife method, N = Njk, the number of 
subvolumes, and Ejk = Njk — 1; for the variance among 
mocks, N is the number of independent realizations, and 

E^var L 

To have a qualitative estimate of the differences in the 
diagonal errors between the two methods, we calculate, 
for each sample, the average of the reduced 3PCF for 
all angles in a particular configuration, i.e., we calcu- 
late Q(s = 15, u — 2,9), the mean of Q{9) for triangles 
with s = 15.0 ft,~ 1 Mpc, u = 2.0, as a function of the 
number N of samples. We do this for three values of 
theta, one corresponding to an almost collapsed trian- 
gle (9 = 18°), another for a rectangular configuration 



(9 = 90°), and one for an almost completely extended 
configuration (9 — 160°). We chose this scale since the 
3PCF here is well behaved and docs not have big diag- 
onal errors, but also is large enough to make visible the 
effects of the JK resampling in the 3PCF (i.e. test the 
'independent regions' hypothesis). 

The results are shown in Figure [5] In the left panel, 
we compare Q(s = 15, u = 2,9) for mocks and for JK 
subsamples. The average values converge very fast with 
N, and the fact that the means of the jack-knife samples 
and the mocks are close to each other (differences at the 
level of 5% on these scales) is a good signal that we can 
use the mocks for the covariance measurements. In the 
right panel we show the variance (standard deviation) 
of Q(s,u,9) as a function of N, for the different meth- 
ods. We notice that while the mock's variance seems to 
converge at large N, the JK errors tend to increase. 

4.2. Covariance matrices 

For TV-point correlation functions, measurements for 
different configurations are strongly correlated, therefore 
the off-diagonal errors must be included in model-fitting 
and parameter estimation. Figure [10] shows the correla- 
tion matrices of the 3PCF for configurations with s = 15 
/i Mpc, u — 2, and varying 9, comparing the jack-knife 
(top row) and mock-covariance methods (middle row), 
for different numbers of jack-knife sub- volumes and mock 
realizations. For both methods, the off-diagonal correla- 
tion coefficients increase with increasing N , which goes 
from 10 to 160 subsamples/realizations. 

For these configurations, the off-diagonal terms tend 
to converge when N > 40. As can be seen in the 
bottom row of Figure QUI where we show the ratio be- 
tween the off-diagonal terms of the covariance matri- 
ces, \C^ ocks (N)/C^ K (N)\, both methods yield similar 
results except for the terms relating very collapsed and 
very elongated configurations, where the jack-knife con- 
figurations have higher correlations. This enhanced co- 
variance for the JK methods is due to the fact that this 
method uses only one 'realization'; while on small scales 
the regions can be effectively independent, on large scales 
(i.e., when examining large triangles) these regions are 
correlated, and then the JK assumption breaks down. 

Even though for large N the covariance matrix con- 
verges for the JK method, and we would be able to ex- 
tract more information from the covariance matrix, we 
would be introducing artificial correlations since the big- 
ger TV is, the smaller and "less independent" the volumes 
of the subsamples arc. In our analysis of large-scale clus- 
tering (for bias and cosmological parameters) we will use 
the 3PCF error covariance matrices from the variation 
among the 160 mock catalogs. 

5. LRG BIAS 

In the following we quantify the differences between 
dark matter and LRG clustering, known as galaxy bias- 
ing. As we have seen above, there are important differ- 
ences between them, and, in a similar fashion to what 
can be done from 2-point correlations, we would like to 
measure the bias of the LRG 3PCF to obtain cosmo- 
logical information. One way to relate dark matter and 
galaxy clustering in real space is to adopt a determin- 
istic and local bias model ( e.g. iFrv fc Gaztanagal 119931 : 
iFrieman fc Gaztanagalfl994 ). 
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matrix element i,j corresponds to the correlation coefficient of Q(9 = 7r/30 + tt * (i — 1)/15) and Q(9 = 7r/30 + tt * (j 



Sgal = f(Sdm) = Mdrn + TT^dm + 



(8) 



where S ga i and dd m are the local galaxy and dark matter 
overdensities smoothed over some scale R. To leading 
order, this bias prescription leads to a relation between 
the galaxy and dark matter reduced 2PCF and connected 
3PCF amplitudes, 



density of spheres with radius R = 8 h~ 1 Mpc, cr%. This 
parameter is related to the overall amplitude of the dark 
matter 2PCF. If we use the 2PCF in equation ©, adopt- 
ing a fiducial a[ ld , then t here is a degeneracy between 
as and b\ in linear theory (jPan fc Szapudill2005| ): 



£gal = b\ 



Q-8 



fid , 



(11) 



Cgai(ri2,r 2 3,r 3 i) ~ blQ m (r 12 , r 23 ,r 31 ) + 



(9) 



b2bl[£,d7n(ri2)Um(r23) + perm.pO) 

In addition to the bias, the 3PCF can help breaking the 
degeneracy between the bias and the r.m.s variance in 



For the 3PCF, there is a factor (as/cr[ ld ) 4 that needs to 
be multiplied the right side of eq. (|TU|) . To find the best- 
fit for the bias parameters and as, we use dark matter 
correlations in a particular fiducial cosmology, and we use 
equation ([II]) for the 2PCF fit; for the reduced 3PCF, in 



SDSS LRG 3PCF bias 



11 



i i i i I i i i i I i i i i I i i i i 



If/ /// 
/a' /// 



0.5 - 



O 




- 



LRG DR7-Dim 

■ s= 15 + 20+30 h-'Mpc 
- s=10+15+20+30 h-'Mpc 



-0.5 



i i i i i i i i i i i i i i i i i i i 



1 1.5 



2 2.5 
c, 



1.2 



1 - 



CO 

b 




0.8 - 



g i i i i i i i i i i i i i i i i i i i i 
1 1.5 2 2.5 3 




Fig. 11. — Best-fit marginalized likelihood of bias parameters and erg from the 2PCF and 3PCF using the LRG DR7-Dim sample. On 
the top plots, contours represent Ax 2 = 1.0, 2.3 and 6.2. On the bottom plots, LRG DR7-Dim 3PCF in symbols and lines represent biased 
dark matter 3PCF values see best-fit parameters in Table [3] 



this prescription reads 



}gal — — (Qdm + C 2 ) 
Cl 



(12) 



where c\ = b\ and c-2 = ^2/^1, where the dependence on 
as cancels, thus allowing us to break the degeneracy be- 
tween bi and ct§ from the 2PCF. We have to remember, 
however, that these approximations and expansions are 
applicable only on large scales and in real space, whereas 
our measurements are made in rcdshift space, where cor- 
relations are distorted due to peculiar velocities. There- 
fore, we have to test the validity of these formulas, and 
try to quantify their effects on the scales we are using for 
these comparisons. We will test how well this bias pre- 
scription captures the clustering statistics by fitting these 
relations to the dark matter and galaxy 2- and 3-point 
correlation functions. 



5.1. Constraints on bias parameters and (7% in redshift 

space 

We compare the dark matter (from the N-body sim- 
ulations described in [23]) and LRG 2PCF and 3PCF in 
order to constrain bias parameters using the relations 
(flTj) and (TT2")) , in this case assuming these relations are 
valid in redshift space, i.e. ^r-space _^ ^z-space anc j 
qr-space qz- space _ We uge all configurations with 
s > 10 h^Mpc, and s > 15 /i^Mpc with u = 2 to 
measure the joint likelihood of the bias parameters and 
cr 8 . 

As d escribed by[G aztanag a fc Scoccimarrol (12005D an d 
used in lGaztanaea et all (|2005[ ) and lMarm et all (|2008D . 



we minimize 



i=2N b j=2N h 

i=i .7=1 



(13) 



where Nt, is the number of configurations used (iVf, 
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Fig. 12. — Best-fit marginalized likelihood of bias parameters and erg from the 2PCF and 3PCF using the LRG DR7-Bright sample. On 
the top plots, contours represent A\ 2 = 1.0, 2.3 and 6.2. On the bottom plots, LRG DR7-Bright 3PCF in symbols and lines represent 
biased dark matter 3PCF values sec best-fit parameters in Table [3] 



60 for triangles with s > 10 /i _1 Mpc and iV& = 45 for 
triangles with s > 15 h~ Mpc). Since we use both 2- 
and 3-point correlations we have 

A, = (£(r 3 )f s - ar 3 )T° del )/^(r s )i, for i < N b (14) 
A, = (Qf s - Q7 odel )/<?Q{0 , for i > N b (15) 
where £( r3 ) rnodei anc | Qmodei are gj ven e q S _ ^ anc [ 
([12")) for the galaxies. The matrix Cij : svD is the nor- 
malized covariance matrix, recalculated with the highest 
modes from a Singular Value Decomposition. This co- 
variance matrix is calculated from the LasDamas LRG 
mock catalogs; 0£(r 3 )j and <JQ(i) are the uncertainties in 
£(^3)1 and Q{i), respectively, and we will assign them the 
error bars from the mocks . We f ollow the suggestion of 
IGaztahaga fc Scoccimarrol (|2005l ) and use configurations 

where the eigenvalues A < yj2/N b . As mentioned above, 
wc actually arc more limited by the number of mocks 
that we use: we have to use a number of modes smaller 



than the number of mocks. 

The results arc shown in Figures QT] and [12j where we 
use the covariance matrix from the 160 LasDamas mocks 
for each sample to constrain ci, C2, and a%. In Figure [TT] 
we show the results for the DR7-Dim sample, and in Fig- 
ure[T21 the results for the DR7-Bright sample. On the top 
panels of each figure we show constraints on the bias pa- 
rameters and 08, for two sets of configurations: solid lines 
use triangles with s > 10 h _1 Mpc and dotted lines use 
only the largest triangles s > 15 /i _1 Mpc The contours 
correspond to A\ 2 — 1.0, 2.3 and 6.2 (corresponding to 
la limits for 1-parameter fit, la limit for 2 parameter-fit, 
and 2a limit for 2 parameters, respectively). 

We can see that in both samples a zero non-linear bias 
is excluded at 2a significance. We can see that, as ex- 
pected, DR7-Dim galaxies have a lower linear bias c\ 
than the DR7-Bright, and there is a bigger difference in 
the non-linear bias parameter C2. Using different sets of 
triangles changes the results affects the area of the con- 
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Fig. 13. — Best-fit bias parameter constraints using correlation 
functions separately and with fixed ag. Left plots show bias con- 
straints using the 2PCF and 3PCF when fixing ag, right plots show 
in solid straight lines, the best fit for c\ using only the 2PCF when 
ag is fixed, and the ellipses show constraints in ci and C2 using the 
3PCF only. Upper panels shows the constraints for the DR7-Dim 
sample. Lower panels show constraints for the DR7-Bright sample. 
Contour lines represent A\ 2 = 1.0, 2.3 and 6.2 



fidence contours, but also the best-fit results; going to 
smaller scales put us in the direction of the non-linear 
regime, where the relations in cqs. (fTTj) and (fT2")) arc 
not longer valid. Observe that the error bars are not 
small: this is an effect of using a large number of config- 
urations when we have a limited number of realizations 
(this makes the inverse of the covariance matrix highly 
singular, therefore we are obligated to cut many modes), 
and also that our errors increase on large scales. 

With respect to the constraints on as, in general the y 
agree with the WMAP5 results (jKomatsu et al.l I2009D . 
but they are more dependent on which configurations 
are used. 

The best fit values and 1-tr errors are also presented in 
Tabic El 

In the lower two panels we show the LRG 3PCF along 
with the Dark Matter biased 3PCF, using the best-fit 
bias values mentioned above. For the DR7-Dim sam- 
ple the fits are quite good, and there are not significant 
differences between the fits from using different trian- 
gles (solid line corresponds to fit using s > 10 h~ 1 Mpc 
triangles whereas dotted lines use only s > 15 h~ 1 Mpc 
triangles). For the brighter sample, the fits are less good 
for configurations with 9 > 100° dcg. 

We explore how the constraints on the bias parameters 
change if as is fixed (instead of being marginalized), and 
compare results to the analytical bias prediction from 
the HOD models. In Figure Q21 we show these results, 
using only s > 15 h~ 1 Mpc configurations. On the left 
side, the vertical ellipses correspond to contraints on the 
bias parameters using both the 2PCF and 3PCF, using a 
model where as is fixed. As expected, a lower og means 
a higher c\. The constraints on c\ are tight, due to the 
lower errors in £, leaving mostly the uncertainty on ci = 
62/61. 

We compare these bias parameters with that expected 



from the analytical pre dictions from the best-fit HOD 
from (|Zheng et al.l l2009. using projected 2PCF only) for 
both linear and nonlinear bias in the last row of Table 
[U and we have good agreement. Our error bars, in the 
case of as = 0.8, confirm that a zero non-linear bias 
parameter is rejected at the 2-a level. On the right side, 
we show the bias constraints using the 3PCF only. As 
can be seen from the results shown in Table ??, there is 
a slight disagreement in the best-fit values using only the 
3PCF vs. using both the 2PCF and 3PCF. 

5.2. LRG bias in real space 

We explore here the validity of using redshift-space 
correlations to calculate galaxy bias parameters. When 
mapping the galaxies in the sky, we use the redshift of 
the galaxy as an indicator of distance. But since galaxies 
(and the dark matter) are subject to gravitational forces 
from the surrounding structures, peculiar velocities are 
added to the cosmological redshift, distorting our map- 
ping of the large scale structure and as a conseq uence, the 
corre lation functions are affected as well (see IHamiltonl 
119981 for a review). 

In the case of the 3PCF, there are differences between 
the real an d redshift space 3PCF of galaxies and dark 
matt er (eg. iGaztafiaga fc Scoccimarro 2005; Marm etHTJ 
20081). and they will affect bias parameters as well. 
Marin et"aT1 (|2008| ). using mock galaxy catalogs, found 
good agreement, but not perfect, on the bias parame- 
ters in the L* range, between using real or redshift space 
correlations, using s ~ 10 ft, Mpc configurations, and 
poorer agreement for brighter galaxies. That was caused 
by a combination of studying the bias using smaller scales 
(in the mildly non-linear regime) and by the small size 
of the box (only 200 ft, _1 Mpc on the side) for studying 
these galaxies which have a low spatial density. 

As shown in Figure [51 on large scales (with s > 15 
/i _1 Mpc), for the mocks, the difference between the red- 
shift and real space 3PCF is small and falls within the er- 
rors. In the case of the dark matter, there are differences, 
but because of the large bias, the mocks indicate that the 
LRG 3PCF will not show significant differences between 
the redshift and real space measurements on large scales. 

In Figure PHI we show the differences in the correlation 
functions of dark matter a nd mock LRG gal axies from 
the N-body simulations bv lSabiu et al.l (|2010f ) . and how 
the bias parameters and as constraints change from using 
real and redshift space correlations. Note that on small 
scales there are significant differences between redshift 
and real space 3PCFs, but on larger scales, they are very 
similar. We have to keep in mind that these similarities 
occur this binning scheme, and might not be necessarily 
true with different 3PCF resolutions. For the 2PCF, the 
differences are small, but significant. For the bias con- 
straints, note that real space measurements have a higher 
linear and non-linear bias, and usually the best-fit value 
of as is lower in the real space fits. 

There have been analytical attempts to relate the real 
and redshift space 3-point function , in the case of th e 
bispectrum (IScoccimarro eTaII[T999l: fSmith et al.H200l . 
but there are practically no attempts of this in the con- 
figuration space, since non-linearities and mixing of large 
and small-scale modes makes the calculations very com- 
plicated unless one takes very simplified cases or do not 



14 Felipe Marin 



TABLE 2 

Best-fit bias parameters in redshift space 
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order linear appro ximation in the case of redshift dis tor- 
tion l|Kaiserill987t) . Following [Pan fc Szapudil (|2005t) : 



Fig. 14. — Redshift distortions in the 2PCF, 3PCF and in bias constraints. Top panels: 3PCF for different configurations, for dark matter 
in real (thick black solid line) and redshift space (thick red dashed line), and for HOD mocks in real (thin black line, with error bars) and 
redshift space (think dashed line). Bottom left: 2PCF for dark matter and mocks in real and redshift space. Bottom, middle and right: 
Constraints in bias parameters c\ and C2, and in as in real and redshift space. The contours are Ax 2 = 1.0, 2.3 and 6.2 

take full advantage of the shape dependence of the 3- 
point statistics feg. JPan fc Szapud"ill2005t TSefusatti et al.l 
2006|) In our case, we take an empirical approach. 

Using the similarity between the mock LRG 3PCF in 
real and redshift space for the large scales, we estimate 
constraints on the bias and as parameters using the real 
space dark matter correlations, and identifying the real 
space 3PCF with the redshift space 3PCF. In the case of 
the 2PCF, since the differences between real and redshift 
space correlations are significant, we will use the first- 



s—space 
''gal 



1 



3 J 5 J 



Hi 

0.8 



e d - space , (i6) 



where / = f2^ 6 /ci; we tested this prescription for our 
HOD mocks and it works well on the largest scales r > 10 
h~ 1 Mpc. The results are shown in Figure [T51 for both 
the LRG-Dim and LRG-Bright samples, using triangles 
with s > 15 h~ 1 Mpc. In general we see the same trend 
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Fig. 15. — Marginalized bias parameters and erg confidence intervals from joi nt 2P CF-3PCF fit for LRGs in real space for the DR7-Dim 
(top) DR7-Bright (bottom) LRG samples using the prescription described in £15.21 Only s > 15 h~ 1 Mpc configurations are used. The 
contours are Ax 2 = 1.0, 2.3, and 6.2. 



that was seen in Figure 1141 both ci and C2 are higher 
in real space, and we get reasonable constraints on erg. 
We notice that here the constraints are in good agree- 
ment with the val ues obtained from the HOD fit of by 
iZheng et all (|2009fl (shown in Tabled}. 

5.3. HOD constraints from large-scale bias 

It is possible to use the bias parameters constraints to 
estimate best-fit values on the LRG HOD parameters. 
In Appendix lAl we show how it is possible to extract the 
large-scale bias parameters from the HOD, where only 
the average of the HOD function (N(M)) is needed to 
obtain the bias parameters b\ and b\ . Given the un- 
certainties in c\, C2 from the 3PCF, and for a fixed ag, 
it is possible in principle to obtain constra ints on HOD 
parameters (jSefusatti fc Scoccimarrol [20051 ) from large- 
scale bias parameters. We exemplify this using the LRG 
DR7-Dim sample. 

There are many models for the mean (7V(M)), with dif- 



ferent number of parameters. It is worth mentioning that 
even though the meaning of common HOD parameters 
such as M m i n , Mi and a is similar for different functional 
forms of (N(M)), the best fit parameters ca n change sig- 
nifica ntly depending on the function used (|Zheng et al.l 
I2005f ) . For the sake of simplicity, and to avoid marginal- 
izing over many parameters, we use a simple model that 
uses three parameter s , with a soft function for N cen (M) 
(jSefusatti et al.ll2006l iKulkarni et alj|2007l ): 

(N(M)) = N cen (M) + N sat (M), (17) 
N cen = exp(-M min /M), (18) 

N sat = exp(-M min /M) ( — ) . (19) 

We effectively constrain the HOD parameters Mi and 
a, since we fix the spatial density of model galaxies to 
the mean density of the sample, and in practice, this 
constrains M m i„. 



16 



Felipe Marin 



TABLE 3 

Best-fit bias parameters in real space 



Sample Triangles used Fit c\ C2 <xg 

DR7-Dim s > 10 /i^Mpc 2PCF & 3PCF, 3-parameter 2.51±°' \ 5 0.82^11 0.75 jig °l 

DR7-Dim s > 15 fr^Mpc 2PCF & 3PCF, 3-parameter 2.38±2*5 °- 92 ±o.i °- 77 ±o.05 

DR7-Bright s > 10 h^Upc 2PCF & 3PCF, 3-parameter 2 - 22 ±ai °- 75 tai 0- 91 ±o.i 

DR7-Bright s > 15 h~ 1 Mpc 2PCF & 3PCF, 3-parameter 2.45±£'2g 0-Wt°'i 0.85+qi 




10.0 



M, (10 14 h *M ) 

1 sun 

FlG. 16. — Constraints in HOD parameters Mi and a from real- 
space bias measurements, when triangles with u = 2 and s > 10 
h -1 Mpc are used. The white point is the position of the best-fit 
HOD parameters. The contours are Ax 2 = 2.3, and 6.2, from 
inside out 

We create a grid with different values of Mi and a, 
where we calculate the bias parameters b\ = c\ and b\ = 
c\C2 (see Appendix |A|). and then we relate and use the 
results for the constraints on the bias parameters c\ and 
C2 of the DR7-Dim sample when we fix as = 0.8. This 
gives us a likelihood on the values of M\ and a HOD 
parameters. 

In Figure [16] we show the constraints on M\ and a 
using the constraints on c\ and C2 for the DR7-Dim 
LRG sample, using s > 15 /i -1 Mpc configurations, for 
fixed as = 0.8. We notice the constraints are not very 
strong, especially on a. The best-fit HOD parameters, 
Mi = 3.2 x 10 14 ft _1 M , and a = 2.35, and the con- 
straints on Mi are consistent with what is expected from 
fits to 2-point correlations, and although our best fit 
value for a is high, it is consistent with the findings of 
high a for the LRG HOD dBlake et alj|2008t iZheng et all 
20091) . and higher fr om the estimates from 3PCF fits by 
Kulkarni et al.ll2~007t Note that the strongest constraints 
are on Mi, and comparing the shape of these contours 
with the contours of Figure H31 we can conclude that 
Mi , the "average" mass of a halo containing at least two 
LRGs (a central and a satellite) is mostly controlled by 
the linear bias c\, and a is more sensitive to the ci con- 
straints. 

6. CONCLUSIONS AND FUTURE DIRECTIONS 

The purpose of this work was to show that the LRG 
3PCF provides a complementary way to estimate biasing 



and place constraints on cosmological models. With the 
advent of new and larger surveys, it is possible to get 
constraints comparable to other methods, in cosmology 
and galaxy population models. Our main findings are: 

• With a new binning scheme, it is possible to have a 
better measurement of Q(9) and go to larger scales, 
making it possible to use the LRG 3PCF to learn 
about galaxy bias and as- 

• The measurement of error in the 3PCF is non- 
trivial. From measuring the 3PCF both in mocks 
and JK subsamples, we find that the covariance 
matrices have noticeable differences, and JKs tends 
to overestimate the errors and covariance compared 
to using mocks. 

• Using the redshift space 2PCF and 3PCF, we esti- 
mate constraints on the linear and non-linear bias 
parameters. We show that the non-linear bias is 
non-zero at the 2a level for LRGs. 

• Using N-body simulations, for this binning scheme, 
we find that the predicted LRG 3PCF is very sim- 
ilar (within the errors) for real and redshift space 
measurements on large scales. Using this fact, we 
estimate bias parameters in real space, as well as 
as, which gives similar results to WMAP5 esti- 
mates. 

• From the large-scale bias parameter constraints, we 
estimate HOD parameters. Our constraints are not 
very strong, but the best-fit values are similar to 
what is found by other methods. 

Using a new binning scheme that preserves the shape 
dependence of Q(6), it is possible to measure the 3PCF 
on a larger range of scales with a better signal-to-noise 
than previous schemes. We treated here the statistical 
errors only, but future investigations (outside the scope 
of this particular work) need also to address the issue of 
how different resolutions, or binning schemes, can lead 
to systematic errors in the 3PCF measurements and on 
bias constraints. 

The measurement of errors shows that, provided that 
the mocks we use describe adequately the correlations of 
the sample which is analyzed, it is much preferable to use 
these instead of JK resampling as a method to calculate 
the covariance of 3PCF measurements. As opposed to 
the case of the 2PCF (eg lZehavi et al.ll2005bl ). where the 
JK provide a good description, in the 3PCF we would 
need much larger volumes and even there, measure only 
on a limited range of scales, to be able to trust the JK 
error measurements. This is more of a problem when we 
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arc talking about the covariance of the 3PCF measure- 
ments. For diagonal errors, it is possible to obtain them 
using a small number of subsamples N ~ 10, but to have 
a converged covariance matrix, Nf^ 100 are needed. 

The aspects mentioned above will affect the model con- 
straints from the 3PCF, and we will work on these as- 
pects as well in future investigations. We showed that 
LRGs, or equivalently, high-mass galaxies have a high 
(non-zero) positive non-linear bias, and a confirmation 
from what is predicted by the halo and HOD models: 
that high mass galaxies (and halos) have large values of 
the linear and nonlinear bias parameters. 

For larger scales, we take advantage of the fact that 
the LRG real space 3PCF is similar to the redshift space 
counterpart. With better measurements this might not 
be possible to do, especially if the purpose is to obtain 
more precise constraints on as, but the constraints ob- 
tained are consistent with what is expected from simula- 
tions, and moreover, they allow us to put complementary 
constraints on galaxy occupation models. 

The future of the LRG 3PCF looks promising: apart 
from better constraints on biasing, on small scales HOD 
fits will become relevant, and on the largest scales, con- 
straints from the BAO will also contribute to constrain- 
ing cosmological and galaxy formation models. 
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APPENDIX 
A. LRG HOD AND LARGE-SCALE BIAS 
A.l. Halo Mass Function 

The distribution of dark matter halos is biased with respect to the dark matter (|Press k, Schechterlll974l ). therefore 
the galaxies, formed inside these halos, are biased tracer as well. As mentioned above, in the HOD model we proceed 
in two steps: First, we described how dark matter halos are distributed in the universe, and second, how galaxies 
populate these halos. 

To describe the mass function of dark matter halos (i.e., the spatial density of halos as a function of mass) we use 
the Sheth & Tormen mass function (jSheth et aLll2001b IShetli fc Tormenll2002n : 



MM) = 



diner 



al In M 



/(") (Al) 



where p m = QmPc is the average matter density in the universe at the epoch of observation, v = 5 c /o~(M, z), with 
S c = 1.69 the threshold for growth of linear fluctuations, and a, the r.m.s. variance of a sphere of radius R(M) at 
redshift z: 

/d 3 k 
— \W(k,R)\ 2 P(k 7 z). (A2) 

Here P(k, z) is the linear matter power spectrum and W(k, R) is the Fourier transform of a top-hat window of radius 
R. The function f(v) is motivated by the ellipsoidal collapse model dSheth fc Torm en 2002]) and has the form 



f{v) =Ayj^-[l+ H 2 )"1 e"^ 2/2 , (A3) 
where for normalization A = 0.322, p = 0.3 amd a = 0.707. 

A.2. HOD bias 

Knowing the probability function P{N\M) of how galaxies populate halos as a function of mass, we can infer the 
clustering at any scale. The HOD model allow us, knowing the mean occupation number (N(M)). in the large-scale 
limit to estimate the bias parameters: 

1 1"°° 

k = — / dMn h (M)b>l(M) (N g (M)) , (A4) 
n g JM min 

where i=l,2 represent the linear and first non-linear bias term, rih(M) the dark matter halo mass function, b^(M) is 
the halo bias and (N g (M)) is the mean number of galaxies for a halo of mass M, and n g is the galaxy number density, 
calculated as: 

poo 

n g = / dMn h (M) (N(M)) , (A5) 

The dark matter halo bias parameters are given by iScoccimarro "eTall (12001 

b f l(M) = l + e 1 +E 1 (A6) 
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the coefficients given by 



b% (M ) = Jr (ej + E 1 ) + e 2 + E 2 (A7) 



of — 1 , . . 

d = — K , (A8) 

«-2!^S, (A9) 

£ - = m + T^)-) ' (A10) 

f = i^ + 2 £ , (All) 

It is worth emphasizing that the bias relations mentioned in equation (|A4[) are only valid on large scales, in the quasi- 
linear regime, and they are scale independent. On smaller scales the bias becomes very scale-dependent and higher 
moments of the P(N\M) function need to be known. 



